Estimating and Optimizing Performance for Parallel Programs

نویسنده

  • Thomas Fahringer
چکیده

P 3 T is an interactive performance estimator that assists users in performance tuning of scientiic Fortran programs. It detects performance bottlenecks in the program, identiies the causes of performance problems, and guides the user in selecting eeective program transformations in order to gain performance. Four of the most critical performance aspects of parallel programs are estimated: load balance, cache locality, communication and computation overhead. P 3 T is an integrated tool of the Vienna Fortran Compilation System, which enables the estimator to aggressively exploit considerable knowledge about the compiler's analysis information and code restructuring strategies. We evaluate existing features and describe substantial enhancements in three key areas: graphical user interface, performance parameters and input programs. P 3 T 0 s graphical user interface directs the user to bottlenecks in a computation that prevent the program from performing well. In addition, it allows ltering and visualizing of performance data at various levels of detail. A new performance parameter, which models computation time is discussed. Furthermore, P 3 T has been extended to handle a subset of HPF programs. Experiments show the ability of P 3 T to successfully guide the user in performance tuning of parallel programs. This includes the selection of performance eecient data distribution strategies and prootable program transformations. The results are very encouraging, and demonstrate the feasibility of automatic performance prediction for parallel programs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel computation framework for optimizing trailer routes in bulk transportation

We consider a rich tanker trailer routing problem with stochastic transit times for chemicals and liquid bulk orders. A typical route of the tanker trailer comprises of sourcing a cleaned and prepped trailer from a pre-wash location, pickup and delivery of chemical orders, cleaning the tanker trailer at a post-wash location after order delivery and prepping for the next order. Unlike traditiona...

متن کامل

Survey of optimizing techniques for parallel programs running on computer clusters

In the current field of high performance computing, clusters technologies plays an ever increasing role. This paper tries to summarize state-of-the techniques for optimization of parallel programs designed to run on computer clusters. Optimizing parallel programs is a much harder task than optimizing sequential programs due to the increased complexity caused be communication and synchronization...

متن کامل

P3T: An Automatic Performance Estimator for Parallel Programs

The area of parallelizing compilers for distributed memory multicomputers has seen considerable research activity during the last few years. Most of the current compilers do not provide any support for estimating performance impacts of code changes that they apply. In this paper, we present P 3 T, which is a static and automatic performance estimator for data parallel programs. It computes at c...

متن کامل

Dynamically Adaptive Parallel Programs

Dynamic program optimization is the only recourse for optimizing compilers when machine and program parameters necessary for applying an optimization technique are unknown until runtime. With the movement toward portable parallel programs, facilitated by language standards such as OpenMP, many of the optimizations developed for high-performance machines can no longer be applied prior to runtime...

متن کامل

AN EFFECTIVE HYBRID METHOD FOR OPTIMIZING STEEL FRAMES WITH IMPROVED SEISMIC PERFORMANCE

The main purpose of this paper was to use a combination of Energy-based design method and whale algorithm (WOA), hereinafter referred to as E-WOA, to optimize steel moment frames and improve the seismic performance. In E-WOA, by properly estimating the seismic input energy and determining the optimal mechanism for the structure, steel frames are designed based on the energy balance method; acco...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Computer

دوره 28  شماره 

صفحات  -

تاریخ انتشار 1995